Beyond Zipf’s Law: The Lavalette Rank Function and Its Properties
نویسندگان
چکیده
Although Zipf's law is widespread in natural and social data, one often encounters situations where one or both ends of the ranked data deviate from the power-law function. Previously we proposed the Beta rank function to improve the fitting of data which does not follow a perfect Zipf's law. Here we show that when the two parameters in the Beta rank function have the same value, the Lavalette rank function, the probability density function can be derived analytically. We also show both computationally and analytically that Lavalette distribution is approximately equal, though not identical, to the lognormal distribution. We illustrate the utility of Lavalette rank function in several datasets. We also address three analysis issues on the statistical testing of Lavalette fitting function, comparison between Zipf's law and lognormal distribution through Lavalette function, and comparison between lognormal distribution and Lavalette distribution.
منابع مشابه
Zipf’s Law Revisited
Zipf’s law states that the frequency of occurence of some event as a function of its rank is a power-law function. Using empirical examples from different domains, we demonstrate that at least in some cases, increasingly significant divergences from Zipf’s law are registered as the number of events observed increases. Importantly, one of these cases is word frequency in a corpus of natural lang...
متن کاملA Comparative Analysis of Gibrat’s and Zipf’s Law on Urban Population
The regional economics and geography literature on urban population size has in recent years shown interesting conceptual and methodological contributions on the validity of Gibrat’s Law and Zipf’s Law. Despite distinct modeling features, they express similar fundamental characteristics in an equilibrium situation. Zipf’s law is formalized in a static form, while its associated dynamic process ...
متن کاملZipf's law and L. Levin's probability distributions
Zipf’s law in its basic incarnation is an empirical probability distribution governing the frequency of usage of words in a language. As Terence Tao recently remarked, it still lacks a convincing and satisfactory mathematical explanation. In this paper I suggest that at least in certain situations, Zipf’s law can be explained as a special case of the a priori distribution introduced and studied...
متن کاملZipf's law emerges asymptotically during phase transitions in communicative systems
Zipf’s law predicts a power-law relationship between word rank and frequency in language communication systems, and is widely reported in texts yet remains enigmatic as to its origins. Computer simulations have shown that language communication systems emerge at an abrupt phase transition in the fidelity of mappings between symbols and objects. Since the phase transition approximates the Heavis...
متن کامل2 3 A ug 2 00 5 Zipf ’ s law in Multifragmentation
We discuss the meaning of Zipf’s law in nuclear multifragmentation. We remark that Zipf’s law is a consequence of a power law fragment size distribution with exponent τ ≃ 2. We also recall why the presence of such distribution is not a reliable signal of a liquid-gas phase transition. PACS number(s): 25.70.Pq, 05.70.Jk, 64.60.Ak The search for reliable signatures of the liquid-gas phase transit...
متن کامل